Performance Metrics for Ocean and Air Quality Models on Commodity Linux Platforms

نویسنده

  • George Delic
چکیده

This report examines performance of four models with two compilers on the Intel Pentium 4 Xeon processor using the PAPI performance event library to collect performance counter values. The intent is to identify performance metrics that indicate where performance inhibiting factors occur when the codes execute. Results for operations, instructions, cycles, cache, table lookaside buffer misses, branching instructions, and I/O operations are discussed in detail. Results support the thesis that codes optimized for vector register computer architectures also benefit from commodity cache-based architectures and the compilers that optimize for them. Consequences for Air Quality Models are analyzed by comparing results against two ocean models with a view to potential performance benefits on commodity architectures. Introduction This is a report on a project to evaluate industry standard fortran 90/95 compilers for IA-32 LinuxTM commodity platforms when applied to Air Quality Models (AQM). The goal is to determine the optimal performance and workload though-put achievable with commodity hardware. Only a few AQM’s have been successfully converted to OpenMP (CAMx, [1]) or MPI (CMAQ, [2]) and considerable work remains to be done on others. In exploring the potential for parallelism it has been interesting to discover the problems with serial performance on several AQM codes. For this reason we have searched for more precise metrics of performance as an aid to measuring progress in performance enhancement. The historical analogy is the programming environment on Cray architectures which enabled the development of performance attributes for either individual codes or workloads using hardware performance counters [3,4]. Since commodity processors also have performance counters, software interfaces, such as PAPI [5], may be used to read them. This study applied the PAPI library in understanding what delivered performance is for two AQM’s, ISCST3 [6] and AERMOD [7], and what the optimal achievable performance can be. For the latter, as a base-line, two Ocean models with good vector character have been included. These are used to measure the optimal performance to be expected on commodity hardware with available compiler technology. The intent in this report is to identify performance metrics that show where performance inhibition occurs on the hardware architecture, and not to so suggest what needs to be done

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Windows NT, Linux, and QNX as the basis for cluster systems

Clusters use commodity hardware and software components to provide an environment for highperformance parallel processing. A major issue in the development of a cluster system is the choice of the operating system that will run on each node. We compare three alternatives: Windows NT, Linux, and QNX—a real-time microkernel. The comparison is based on expressive power, performance, and ease-ofuse...

متن کامل

Quality of Service Support for Real-time Storage Systems

The performance and capacity of commodity computer systems have improved drastically in recent years. However, these systems still lack the support for real-time data access, which is required by an increasing number of emerging applications. In this paper we first present several important storagebound real-time applications and classify their Quality of Service (QoS) requirements. We then sur...

متن کامل

Evaluating the performance of Atmosphere-Ocean Global Circulation Models (AOGCM) in simulating temperature variable in Ahwaz and Abadan stations

Climate changes caused by global warming has presented challenges to human society. Studying the Changes of climate variables in the future decades by using output data’s of Atmosphere-Ocean Global Circulation Models (AOGCM) is a way of perusing climate fluctuation in a region. In this study, the focus is on the AOGCM proceeds in simulating of variable temperature in Ahwaz and Abadan stations. ...

متن کامل

Capabilities of data assimilation in correcting sea surface temperature in the Persian Gulf

Predicting the quality of water and air is a particular challenge for forecasting systems that support them. In order to represent the small-scale phenomena, a high-resolution model needs accurate capture of air and sea circulations, significant for forecasting environmental pollution. Data assimilation is one of the state of the art methods to be used for this purpose. Due to the importance of...

متن کامل

Impact of CPU-bound Processes on IP Forwarding of Linux and Windows XP

These days, commodity-off-the-shelf (COTS) hardware and software are used to build high-end and powerful workstations and servers to be deployed in today’s local area networks of private homes and smallto medium-sized business. Typically, these servers are multipurpose and shared running networking functionalities involving IP packet forwarding in addition to other CPU intensive applications. I...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005